02 JAN 2011 by ideonexus
Natural Language Processing vs. Semantic Web
NLP works well statistically; the SW, in contrast, requires logic and doesn't yet make substantial use of statistics. Natural language is democratic, as expressed in the slogan 'meaning is use' (see Section 5.1 for more discussion of this). The equivalent in the SW of the words of natural language are logical terms, of which URIs are prominent. Thus we have an immediate disanalogy between NLP and the SW, which is that URIs, unlike words, have owners, and so can be regulated. That is not to sa...A short comparison of the difference between NLP and SW in terms of processing, algorithms, structure, and emergence. NLP is described as 'democratic', where the power of SW is that URIs 'have owners,' meaning they are a top-down construct. Perhaps this is the problem of the Semantic Web and why it may never catch on: the web favors emergent semantics and democratized development.
02 JAN 2011 by ideonexus
Ontologies vs. Folksonomies
It is argued - though currently the arguments are filtering only slowly into the academic literature - that folksonomies are preferable to the use of controlled, centralised ontologies [e.g. 259]. Annotating Web pages using controlled vocabularies will improve the chances of one's page turning up on the 'right' Web searches, but on the other hand the large heterogeneous user base of the Web is unlikely to contain many people (or organisations) willing to adopt or maintain a complex ontology. ...Ontologies provide structure and a standard for tagging and searching, while folksonomies provide for an emergent system for tagging things.
02 JAN 2011 by ideonexus
The Importance of Time-Stamping to Relevance
Time-stamping is of interest because the temporal element of context is essential for understanding a text (to take an obvious example, when reading a paper on global geopolitics in 2006 it is essential to know whether it was written before or after 11th September, 2001). Furthermore, some information has a 'sell-by date': after a certain point it may become unreliable. Often this point isn't predictable exactly, but broad indications can be given; naturally much depends on whether the inform...Time-stamping is a crucial function of semantic data. Some information grows less accurate over time, while a context is desired for other data.